Glycosyltransferase 8 domain-containing protein 1 (GLT8D1) is a UDP-dependent galactosyltransferase

Glycosyltransferases (GTs) are enzymes that catalyze the formation of glycosidic bonds and hundreds of GTs have been identified so far in humans. Glycosyltransferase 8 domain-containing protein 1 (GLT8D1) has been associated with central nervous system diseases and cancer. However, evidence on its enzymatic properties, including its substrates, has been scarcely described. In this paper, we have produced and purified recombinant secretory GLT8D1. The enzyme was found to be N-glycosylated. Differential scanning fluorimetry was employed to analyze the stabilization of GLT8D1 by Mn2+ and nucleotides, revealing UDP as the most stabilizing nucleotide scaffold. GLT8D1 displayed glycosyltransferase activity from UDP-galactose onto N-acetylgalactosamine but with a low efficiency. Modeling of the structure revealed similarities with other GT-A fold enzymes in CAZy family GT8 and glycosyltransferases in other families with galactosyl-, glucosyl-, and xylosyltransferase activities, each with retaining catalytic mechanisms. Our study provides novel structural and functional insights into the properties of GLT8D1 with implications in pathological processes.

Glycosylation is associated with tissue and cell homeostasis and it is dysregulated in many diseases, such as cancer or neurological diseases 4 .Alterations in the glycosylation biosynthetic machinery as consequence of mutations, expression level or localization of GTs (EC 2.4.x.y), glycosyl hydrolases, and nucleotide sugar transporters lead to dysregulated glycosylation.Indeed, mutations in glycosylation-associated genes, including GTs, are responsible for a group of rare diseases designated as congenital disorders of glycosylation, most of them affecting the central and peripheral nervous system 5 .
GTs are involved in the biosynthesis of glycoconjugates.Notably, they consist of enzymes that transfer glycosyl residues from activated sugar donors (nucleotide diphosphate sugars, nucleotide monophosphate sugars and sugar phosphates) onto nucleophilic acceptors, with the formation of glycosidic bonds.For O-GTs that use nucleotide diphosphate sugars as donors the reaction scheme is 6 : During the formation of the glycosidic bond the anomeric configuration (alpha/beta) of the product can either be conserved or inverted relative to the donor substrate, and based on this, GTs are classified as either retaining or inverting enzymes, respectively 7 .GTs have been organized in families in the CAZy database (www.cazy.org) based on sequence and structural differences, and at present 116 families of GTs have been identified (as of November 6, 2023).3-D structure analysis of the catalytic domains of GTs allowed their classification into three groups GT-A, GT-B, and GT-C based on their fold, in addition to some additional rarer fold types 8 .GT-A enzymes have one Rossman-like fold that binds nucleotides and, in general, enzymes that require metal ion for activity present a DxD sequence motif that coordinates a divalent metal ion and the nucleotide sugar.The GT-B group contains two Rossman-like folds and does not employ a metal ion for catalysis.GT-C GTs contain several transmembrane domains and use lipid-linked sugar donors, as found in the oligosaccharyltransferase catalytic subunit.
Recent results supported the biological importance of glycosyltransferase 8 domain-containing protein 1 (GLT8D1).Based on exome sequencing in an autosomal-dominant amyotrophic lateral sclerosis pedigree, mutations in the GLT8D1 gene were associated with the neurodegenerative disease amyotrophic lateral sclerosis 9 .The GLT8D1 gene was also found associated with psychiatric disorders such as schizophrenia 10 .Its expression was decreased in the hippocampus of schizophrenia patients and evidence indicated its involvement in the pathophysiology of the disease 11 .GLT8D1 is also associated with cancer: a mutation in GLT8D1 was found in soft tissue tumors 12 ; it was proposed as prognostic marker for melanoma 13 and gastric cancer where it correlated with tumor immunity markers 14 ; it promoted human glioblastoma cell migration 15 .More recently, evidence supported a role of GLT8D1 in neurotrophin signaling within membrane lipid rafts 16 .
GLT8D1 is a type II membrane glycoprotein of 41.9 kDa (371 amino acids, UniProtKB-Q68CQ7) with three potential N-glycosylation sites at N103, N249 and N257.GLT8D1 is composed of a short cytoplasmic domain (amino acids 1-7), a transmembrane domain (amino acids 8-28) that anchors the protein to the membrane and a luminal part (amino acids 29-371) that includes a short stem region and a catalytic domain.GLT8D1 is localized in the trans-Golgi network of HEK293 and N2A cells 9 and in the Golgi and endoplasmic reticulum of NCH601 cells 15 .
GLT8D1 belongs to the GT8 family of the CAZy database 17 , which includes the glycogenin glucosyltransferases (GYG1 and GYG2), the EGF domain-specific xylosyltransferases (GXYLT1, GXYLT2, XXYLT1), and the xylosyltransferase domain of the bifunctional matriglycan co-polymerase, LARGE1 and LARGE2 in Homo sapiens and α3-galactosyltransferase, α4-galactosyltransferase, and α-glucuronosyltransferase activities in other species.GXYLT1 and GXYLT2 were formerly known as GLT8D3 and GLT8D4, respectively; on the other hand, in mammals GLT8D1 has a closely related sequence homolog of unknown function (GLT8D2) 18 .In contrast to the glucosyl-or xylosyltransferase activity of the other mammalian GT8 enzymes, hydrolysis of UDP-galactose 9,15,19 and UDP-glucose 15 was reported for GLT8D1.So, the information concerning GLT8D1 activity and substrate specificity is still scarce.
In this work we have explored several sugar nucleotide donors and potential acceptors of recombinant GLT8D1 using differential scanning fluorimetry (DSF), a luminescent glycosyltransferase assay, HPAEC-PAD and LC-MS/MS aiming at characterizing its activity and specificity.We also examined an AlphaFold generated 3D model of GLT8D1, which provided structural insights into the protein fold, comparison with other related enzyme structures, and location of disease associated mutations.

Production of recombinant soluble GLT8D1
The GLT8D1 lumenal catalytic domain (residues 52-371) was expressed in HEK293 cells with N-terminal fusion sequences encoding a signal sequence, His 8 tag, AviTag, GFP and TEV protease cleavage site as previously described 2 (Fig. 1A).The recombinant secreted GLT8D1 fusion protein has an expected mass of 67,624 Da.The enzyme was purified from the culture supernatant by affinity using a Ni 2+ -NTA column with a final polishing step by gel filtration on a Superdex 75 column (Fig. S1).Purified GLT8D1 migrated as a single band in SDS-PAGE slightly below 75 kDa standard (Fig. 1B).Recovery was 3 mg of purified protein from one liter of cell supernatant.
GLT8D1 has three potential N-glycosylation sites.Digestion with peptide N-glycosidase F (PNGase F), which cleaves complex, oligomannose and hybrid N-linked glycans, caused a clear downward shift in SDS-PAGE (Fig. 1C).By contrast, with endoglycosidase H (Endo H), which cleaves only oligomannose and some hybrid glycans, there was no shift in SDS-PAGE mobility indicating that recombinant GLT8D1 is N-glycosylated predominantly with complex N-glycans.

Screening of suitable ligands for GLT8D1 by differential scanning fluorimetry
Thermal denaturation of purified GLT8D1 in fusion with green fluorescent protein (GFP) monitored by intrinsic tryptophan fluorescence-based differential scanning fluorimetry (ITF-DSF) yielded a major transition with a melting temperature (T m ) of 38.4 ± 0.1 °C (Fig. 2A) and a second minor transition assigned to GFP with a T m ≈ 85 °C (data not shown).Incubation with Mn 2+ induced a T m shift of + 4.5 °C, in line with the expected stabilization afforded by Mn 2+ .From the tested 'scaffold' nucleotides, UDP caused the highest stabilization (ΔT m = + 11.2 °C with respect to Mn 2+ ), followed by GDP (ΔT m = + 7.7 °C).CMP resulted in a modest stabilization (ΔT m = + 1.3 °C) with respect to Mn 2+ .UDP-based donors afforded lower degrees of stabilization than their UDP 'scaffold' (Fig. 2B).While UDP-GlcA resulted in a ΔT m = + 4.9 °C with respect to Mn 2+ , all other UDP-based donors produced ΔT m values ≈ 2 °C with respect to Mn 2+ .None of the acceptors (GalNAc and GlcNAc) appeared to stabilize GLT8D1 in the presence or absence of UDP-Gal and/or Mn 2+ (Fig. 2C).

GLT8D1 enzymatic activity
Since the UDP nucleotide led to the highest GLT8D1 stabilization, the hydrolytic activity of the enzyme towards a panel of UDP sugar donors was investigated.Indeed, many GTs also exhibit hydrolysis activity toward their sugar nucleotide donors with water acting as an acceptor in the reaction.A panel of nine UDP monosaccharides was used in GLT8D1 assays in the absence or presence of the divalent cation Mn 2+ .As a positive control for hydrolysis, wild-type B4GalT1 and the B4GalT-Y289L mutant with additional GalNAc-transferase activity 20 were also employed.Hydrolysis of most substrates tested with the exception of UPD-Glc and UDP-GalNAc was observed in the presence of Mn 2+ (Fig. S2A) but the putative activity was very low, approximately 50 to 150-fold lower than the positive control with B4GalT1 (Fig. S2B).These results indicate that whereas the hydrolase activity may reflect the sugar nucleotide specificity of the enzyme as it was detected only in the presence of Mn 2+ , that activity is not efficient or selective since several donor nucleotide sugars were hydrolyzed.
Aiming at identifying potential GLT8D1 acceptors several combinations of donors and acceptors were tested using the UPD-Glo assay.With UDP-Gal as donor some transferase reaction was detected towards GalNAc and GlcNAc whereas the values with biotinylated N-acetylglucosamineβ-OCH 2 CH 2 NH 2 , Xyl, Glc or Gal were comparable to those of the control without enzyme (Fig. 3A).On the other hand, when UDP-GalNAc, UDP-GlcNAc or UDP-GlcA were tested as potential donors combined with GalNAc or GlcNAc as potential acceptors the release of UDP was comparable to that of the control without enzyme.
Potential products of Gal transfer from UDP-Gal onto GalNAc catalyzed by GLT8D1 were screened for by LC-MS/MS analysis.A peak at m/z value of 384.15 consistent with the composition HexHexNAc was detected (Fig. 3B).The extracted ion chromatogram also displayed the presence of a molecule compatible with HexHexNAc only detected when enzyme, donor and acceptor were present (Fig. 3C).In agreement a minor peak was also detected by HPAEC-PAD analysis, but the intensity was too low to allow quantification.The corresponding MS 2 spectrum displayed fragments compatible with lacto-N-biose I and β-d-Galp-(1-3)-d-GalpNAc with FISh  www.nature.com/scientificreports/coverage matching of 77% (Fig. 3D; Fig. S3).Moreover, a matching of 70% with T-antigen and core 8 (α-d-Galp-(1-3)-α-d-GalpNAc) was calculated (Fig. S3).Aiming at elucidating linkage anomericity, NMR analysis was performed but the product was not detectable by this technique.Using GlcNAc as a substrate a peak at m/z value of 384.15 compatible with the composition HexHexNAc was also detected (Fig. S4A).The MS 2 spectrum of the product showed a 98.2% match with that of standard N-acetyllactosamine (Galβ4GlcNAc, LacNAc) analyzed in the same conditions (Fig. S4B).In agreement, a 97.4% match with LacNAc was obtained using mzCloud search.
A number of other compounds (Tables S1, S2) were assessed but found not to react as acceptors (see Figs. S5, S6 and related discussion).

Structural modeling of GLT8D1
Prior studies have investigated the modeling of the GLT8D1 protein structure and residues potentially involved in substrate interactions 15 .Recent developments of structural modeling using AlphaFold2 21 provided the generation of an alternative model for GLT8D1 here that allows comparisons with other known related GTs (Fig. 4).As expected for a member of the GT8 family, the AlphaFold2 model of GLT8D1 contained the landmark features of a GT-A fold enzyme, including a DxD motif, G-loop, and C-terminal His residue in the sugar nucleotide donor binding site (Figs.4A,B and 5) 22 .The fourth landmark feature of GT-A fold inverting enzymes (the 'xED' motif that harbors the catalytic base) was comprised of residues I283-T284-T285, not consistent with its use as a catalytic base or inverting catalytic mechanism, but consistent with the predicted retaining catalytic mechanism for GT8 enzymes 6,22 .
Comparison of the GLT8D1 AlphaFold2 model with other known protein structures using Dali 27 allowed the identification of several glycosyltransferases from CAZy GT8 (Table S3, Fig. 5) which use: UDP-Gal as sugar donor (bacterial WbbM 23 and GlyE 24 ); UDP-Xyl as donor (human LARGE-1 25 and XXYLT1 29 ); UDP-Glc as donor (glycogenin 30 ); or a bacterial enzyme of unknown specificity (PDB 3TZT).Additional related structures were identified in the Dali analysis: GT24 fungal UGGT 31 that uses UDP-Glc as donor; GT6 human ABO 32 that uses UDP-Gal/GalNAc as donor; and two enzymes from GT34 that employ UDP-Glc 33 or UDP-Xyl 34 as donors.These results are also consistent with a broad collection of GT8 subfamily members, as well as members of GT24,   GT34, GT6 and GT88 that all partition into a common clade (Clade 9) and employ a retaining glycosyltransferase catalytic mechanism 22 .www.nature.com/scientificreports/While most of the structures related to GLT8D1 in the Dali search contain bound UDP or divalent metal, few contain an intact sugar nucleotide donor (only glycogenin 30 and ABO 35 ).Preliminary modeling of UDP-Gal and Mn 2+ in the active site of GLT8D1 could be approximated using alignment with related enzymes with bound donor substrates, including the UDP and Mn 2+ complex of the closely related S. pneumoniae GlyE 24 (PDB 5GVV) and the structure of UDP-Gal from B4GALT1 26 (PDB 2FYC) (Fig. 4).While this model approximates the position of the bound donor complex for GLT8D1 similar to prior modeling studies 15 , it also allows a comparison of similarly positioned residues in the other related enzymes (Fig. 5).Residues facing the sugar residue of the donor in the GLT8D1 model include R154, D171, G244, N242, and T284 (Fig. 4A,B).Of these, D171 and G244 are within the DxD motif and G-loop, respectively, and are highly conserved among the 10 related enzymes (Fig. 5).The equivalent of R154 was a Lys or Arg residues in all related enzymes except the xylosyltransferase, XXYLT1, where a Ser is found in that position.The equivalent of N242 was an Asn or Gln for all related enzymes except for the glucosyltransferase, UGGT, that contains an Ile and the human Gal/GalNAc transferase, ABO, that contains a Gly in the equivalent positions.The equivalent of T284 is generally an Asp or Asn for the related enzymes except for the glucosyltransferase, glycogenin (substituted with a Gly), and Chlorella glucosyltransferase (substituted with a Glu).Thus, there is no apparent correlation between the donor specificity of the respective GLT8D1-related enzymes and the residues that face toward the sugar in the UDP-sugar donor binding subsites.

Discussion
In this study, we investigated catalytic and structural properties of recombinant soluble glycosyltransferase GLT8D1.Our results indicated that the enzyme has preference for UDP nucleotide sugar donors and shows galactosyltransferase activity preferentially towards monosaccharide GalNAc as acceptor.A 3D model of GLT8D1 pinpointed structural features of GT-A fold glycosyltransferases such as the divalent metal and nucleotide donor binding sites with features consistent with a predicted retaining catalytic mechanism.
Recombinant GLT8D1 was produced as a truncated soluble secreted catalytic domain fusion protein in HEK293 cells as previously described for the expression and purification of mammalian glycosyltransferases 2 .The secreted product was N-glycosylated based on SDS-PAGE mobility shift following PNGase F digestion as predicted by the presence of 3 N-glycan acceptor sequons.Prior studies had identified a glycan structure on N249 of GLT8D1 from human liver using a combination of enzyme digestion and hydrazide chemistry 36 .
Differential scanning fluorimetry using ITF-DSF showed a high sensitivity binding of Mn 2+ and putative donors to GLT8D1 likely due to the fact that the enzyme has 8 tryptophan residues, 3 of which are quite close to the active site.Stabilization by Mn 2+ agrees with a GT-A fold structure for GLT8D1 and also in line with the hydrolysis results in the absence or presence of the cation.Nucleotide stabilization of GLT8D1 was greatest with UDP, followed by GDP, while CMP had almost no effect, suggesting that UDP sugars were preferred donor substrates.However, among all the UDP-monosaccharides tested, UDP-GlcA caused the highest GLT8D1 stabilization, which did not correspond to a higher hydrolytic activity in our assays; curiously, some members of the GT8 CAZy family have α-glucuronosyltransferase activity.Overall, ITF-DSF has utility to investigate potential substrates for glycosyltransferases.In agreement, others had already reported on the use of conventional DSF with SYPRO Orange fluorescent dye to study the binding of sugar nucleotides to bacterial glycosyltransferases 37 .
Many GTs are capable of hydrolyzing their sugar nucleotide donors in the absence of any acceptor, which is particularly useful to characterize activities when the acceptor substrates are unknown 38,39 .Here, GLT8D1 appeared to hydrolyze UDP-Gal, UDP-Xyl, UDP-GalNAc, UDP-GlcA, UDP-GalA, UDP-Arap and UDP-Araf in the presence of Mn 2+ , but the activity values were low compared with controls of well-described glycosyltransferases B4GALT1 and B4GALT1-Y289L.Some GTs are acceptor-dependent enzymes, such as, fucosyltransferase 7 39 , which could be the case of GLT8D1, and only assays in the presence of the proper acceptor substrate would demonstrate activity.Other reports from the literature showed UDP-Gal hydrolysis by fulllength GLT8D1 9,15 , and also UDP-Glc hydrolysis 15 but the activity values also appear low.It is not known how the presence of the cytoplasmic/transmembrane/stem region from the full-length form, absent from our recombinant secretory form, could affect enzyme specificity.For example, secretory fucosyltransferases 3 and 5 had increased activity toward glycoproteins but decreased activity with glycosphingolipids, compared to the full-length enzymes 40,41 .
GLT8D1 galactosyltransferase activity was investigated using UDP-Gal as donor with several acceptors.We found that the enzyme catalyzed the transfer in vitro of Gal onto GalNAc with the formation of the disaccharide HexHexNAc as detected by LC-MS.The product was consistent with Galβ3GalNAc (lacto-N-biose I and T-antigen or core 1 O-GalNAc glycan) and Galα3GalNAc (core 8 O-GalNAc glycan) as concluded from MS/MS analysis.Since members of the CAZy GT8 family have been shown to employ retaining catalytic mechanism 22 , it is anticipated that the most likely enzymatic product would be Galα3GalNAc.Similarly, a disaccharide compatible with LacNAc structure (HexHexNAc) was detected by MS with GlcNAc as an acceptor.However, no activity was detected towards truncated N-glycans with terminal GlcNAc possibly due to the low concentration used and because such compounds are not efficient substrates.
In view of the low galactosyltransferase reaction efficiency observed with GalNAc as acceptor, it was concluded that this monosaccharide is not the ideal GLT8D1 substrate.However, glycans of higher complexity from carrier molecules in specific structural settings may be better substrates.Potential candidates could be glycolipids, which are known to be relevant in signaling events associated with neurodegeneration 42 and cancer 43 .In this context, recent evidence indicated that GLT8D1 was relevant in ganglioside biosynthesis 16 .Glycoproteins could also constitute potential substrates; for example, the T-antigen and related structures are present in mucins with important roles in cancer 44 and sialylated T-antigen is abundantly expressed in brain tissue glycoproteins [45][46][47] , but the enzymes involved in their biosynthesis have been well-studied.On the other hand, core 8 is a low abundance structure in glycoproteins from brain tissue 46,48 with unknown functional relevance, and GLT8D1 appears to have www.nature.com/scientificreports/ the capacity to synthesize that structure.In view of the several open possibilities, it is necessary to investigate further to find efficient endogenous GLT8D1 substrates, to structurally characterize them and evaluate their role in the mechanisms underlying diseases of the central nervous system and cancer.Prior efforts at modeling the GLT8D1 structure revealed a GT-A fold for the enzyme 49 , including conserved landmark features 15 , and the absence of an xED motif sequence that harbors the catalytic base in inverting enzymes that is consistent with the retaining mechanism for the GT8 family enzymes 22 .We employed an AlphaFold2 model of GLT8D1 to query structure databases and identify well-characterized enzymes with high similarity to GLT8D1.Numerous GT8 proteins were identified as well as structures from GT24, GT6, and GT34 as anticipated from prior informatic analyses that clusters these families into a broader Clade 9 22 .Among the mammalian GT8 enzymes, GLT8D1 has closest sequence homology to GLT8D2, a protein of unknown function, while other mammalian members of CAZy GT8 include the glycogenins (GYG1 and GYG2), members of a xylosyltransferase family of enzymes that modify EGF-like domains (GXYLT1, GXYLT2, XXYLT1) and the xylosyltransferase domains of LARGE1 and LARGE2, all of which are more distantly related to GLT8D1/ GLT8D2.
Several GLT8D1 mutations have been identified in ALS patients including G78W, I70T, A82E, I87N, R92C 9 and I290M 9 ; decreased galactosyltransferase activity was reported for G78W, R92C and I290M mutant.On the other hand, the Q319E substitution was found in soft tissue tumors 12 .Each of these mutations were located quite distant from the putative enzyme active site (Fig. 6) and likely either result in enzyme destabilization or possibly alteration of interactions with other proteins.
In conclusion, we have obtained purified, secreted form of GLT8D1 suitable for enzymatic and structural studies.Our results support that GLT8D1 exhibits low galactosyltransferase activity, but more studies are required to unveil efficient substrates, particularly endogenous substrates that would clarify the functional role of GLT8D1.GLT8D1 was modeled as a GT-A fold structure with conserved active site features in comparison with other related enzyme structures, and disease-associated mutations were mapped on the structural model, but will require future study to identify the structural basis of pathogenesis.

Enzyme production and purification
A fusion protein expression construct encoding the catalytic domain of human GLT8D1 (Uniprot Q68CQ7, residues 52-371) was generated by PCR amplification of the GLT8D1 coding region from a Mammalian Gene Collection clone followed by Gateway BP recombination into the pDONR221 vector and subsequent transfer to the pGEn2 expression vector by Gateway LR recombination 2 .The resulting GLT8D1-pGEn2 expression construct encodes a 25-amino acid signal sequence, an His 8 tag, AviTag, the "superfolder" GFP coding region, and the 7-amino acid recognition sequence of the tobacco etch virus (TEV) protease followed by the human GLT8D1 catalytic domain.The recombinant human GLT8D1 fusion protein was expressed as a soluble secreted product by transient transfection of suspension culture HEK293-F cells (FreeStyle™ 293-Fcells, Thermo Fisher Scientific, Waltham MA) as previously described [50][51][52] .The recombinant protein was purified using Ni-NTA Superflow (Qiagen, Valencia, CA) chromatography as previously described 2 and was concentrated to approximately 3 mg/mL using an ultrafiltration pressure cell (Millipore, Billerica, MA) with a 10-kDa molecular mass cutoff membrane.The recombinant human GLT8D1 was further purified by a Superdex 75 chromatography (GE www.nature.com/scientificreports/Healthcare) preconditioned with a buffer containing 20 mM HEPES, 150 mM NaCl, 0.05% sodium azide, pH 7.0.Peak fractions were pooled, concentrated to 1 mg/mL and buffer exchanged with 20 mM HEPES, 100 mM NaCl, 0.05% sodium azide, pH 7.0, 10% glycerol.The final protein preparation was aliquoted and stored at − 80 °C until use.

Immunoblotting analysis and protein deglycosylation
Proteins were analyzed by SDS-PAGE and transferred to polyvinyledene fluoride membranes that were blocked for 1 h with 5% skimmed dry milk (Nestle) in Tris-buffered saline with 0.1% Tween-20.Primary and secondary antibodies were mouse anti-His Tag IgG1 monoclonal (1:7000; GenScript A00186) and sheep anti-mouse IgG coupled to HRP (1:3000; Amersham, GE Healthcare Europe GmbH, NA934).Washings were with TBST.Detection was performed with the Immobilon Western Chemiluminescent HRP substrate (Millipore).GLT8D1 was digested with peptide N-glycosidase F (PNGase F, Roche) and endoglycosidase H (Endo H, Roche), as previously described 53 , and further analyzed by immunoblotting.

Differential scanning fluorimetry
ITF-DSF assays were performed in a Prometheus NT48 (NanoTemper Technologies GmbH, Munich, Germany).GLT8D1 at 350 µg/mL, in 50 mM TrisHCl pH 7.5, was pre-incubated with 0.1 mM of nucleotides or putative donors (UDP, GDP, CMP, UDP-Gal, UDP-Glc, UDP-Xyl, UDP-GlcA, UDP-GalNAc, UDP-GlcNAc) and/or 2 mM of putative acceptors (GalNAc and GlcNAc), in the absence or presence of 1 mM MnCl 2 , and transferred into 10 μL optical capillaries (NanoTemper).A linear temperature gradient (1 °C/min) between 20 and 90 °C was applied, and protein unfolding was monitored by recording the protein intrinsic tryptophan fluorescence emission at 330 nm and 350 nm (fluorescence excitation at 275 nm).Data in triplicates from the ratio of fluorescence emission at 330 nm and 350 nm were fitted with GraphPad 6, employing a sigmoidal (variable slope) curve and the melting temperatures (T m ) were retrieved from the inflexion points.
Nucleotide sugar hydrolysis assays were performed in 20 μL reactions containing 200 μM UDP-sugar substrates and 10 μL (3 μg) of GLT8D1 enzyme containing 10 mM MnCl 2 , 1 mg/mL bovine serum albumin, 25 mM HEPES, 300 mM NaCl, pH 7.5.Reactions were carried out for 24 h at 37 °C and the resulting UDP was quantified using a UDP-Glo Assay (Promega Corporation, V6971) and compared to a UDP standard curve to calculate the hydrolysis activity.
HPAEC-PAD analysis of N-glycans was performed using an ICS-3000 ion chromatography system (Dionex Corporation), consisting of an AS autosampler, a DC detector/chromatography module and a DP dual pump essentially as described before 53 .Twenty microlitre samples were injected into a CarboPac PA200 column (3 × 250 mm, Dionex).Released oligosaccharides were eluted with a gradient of 0.075 M sodium hydroxide (solvent A) and 0.075 M sodium hydroxide/0.6M sodium acetate (solvent B), at a flow rate of 0.4 mL/min.The gradient consisted of a 3 min isocratic run at 100% solvent A, followed by consecutive increases of solvent B concentration (7 min to 0.7%, 15 min to 5%, 11 min to 16%, 9 min to 30%, 2 min to 100%) and a final step of 13 min at 100% B. The column temperature was at 30 °C.The electrochemical detection of the oligosaccharides was performed by application of the detection potentials and durations as recommended by the manufacturer.
Raw data generated in the untargeted analysis were processed using Compound Discoverer 3.2 (version 3.2.0.421,Thermo Scientific) for metabolite identification (including unsupervised peak detection).The identification searches were performed against the mzCloud MS/MS database, the KEGG and ChEBI databases (using the ChemSpider search node) and a Mass list with the molecules of interest (containing molecular formulas).A mass tolerance of 3 ppm was considered for all searches.The matches with the Mass list were further elucidated using the MS/MS data acquired and the results from the ChemSpider search, by performing FISh (Fragment Ion Search) scoring.This algorithm compares the experimental fragmentation spectra for a compound to the expected fragmentation spectra based on the structure of the compound, giving it a score from 0 to 100.

GLT8D1 structural models and comparisons
The AlphaFold2 structural model for GLT8D1 (AF-Q68CQ7-F1) was used to identify closest structural homologs in the Protein Data Bank (PDB) using Dali 27 .The top ten structures with similarity to the GLT8D1 AlphaFold model were used for structure-based protein sequence alignment using PROMALS3D 28 .Protein structures were aligned and displayed using Pymol (Schrödinger, L., & DeLano, W. (2020).PyMOL.Retrieved from http:// www.pymol.org/ pymol).

Figure 1 .
Figure 1.Recombinant secretory GLT8D1.(A) Schematic representation of the secretory GLT8D1 composed of signal sequence, His, Avi and GFP tags, TEV cleavage site and catalytic domain-containing sequence of GLT8D1.(B) SDS-PAGE analysis of purified GLT8D1 stained with Coomassie Blue.The amount of protein was 3.5 µg.(C) Deglycosylation of GLT8D1.GLT8D1 without treatment (1), or incubated with PNGase F buffer (2), PNGase F (3), Endo H buffer (4) and Endo H (5) is shown.Detection was done by the enhanced chemiluminescent method.The amount of protein applied per lane was 200 ng.Original figures (B) and (C) are presented in Fig. S7.

Figure 2 .
Figure 2. Screening of potential GLT8D1 acceptors and donors by differential scanning fluorimetry.Thermal denaturation of GLT8D1 in the presence or absence of 1 mM Mn 2+ and/or 0.1 mM putative donors and/ or 2 mM putative acceptors, monitored by intrinsic tryptophan fluorescence-based differential scanning fluorimetry.(A) First derivative of thermal denaturation curves obtained from the ratio of fluorescence emission at 330 nm and at 350 nm as a function of temperature (data in triplicates shown as individual curves): as isolated GLT8D1 (Apo; light blue); GLT8D1 incubated with Mn 2+ in the absence (Apo + Mn 2+ ; cyan) or presence of putative donors CMP (orange), GDP (turquoise) and UDP (dark blue).(B) Melting temperature (T m ) values obtained by fitting the data from the ratio of fluorescence emission at 330 nm and 350 nm with a sigmoidal (variable slope) curve (data represented the average of triplicates ± standard deviation).ΔT m values calculated by subtracting the T m of GLT8D1 incubated with Mn 2+ (Apo + Mn 2+ ) to the T m of GLT8D1 incubated with Mn 2+ and putative donors ('scaffold' nucleotides and UDP-based donors).(C) Absent effect of acceptors on the T m of GLT8D1 in the absence/presence of Mn 2+ and UDP-Gal as donor.

Figure 3 .
Figure 3. GLT8D1 activity with UDP-Gal as donor.(A) The assay was done as described in M&M using 0.5 mM nucleotide sugar donor, 20 mM monosaccharide, 0.8 µg GLT8D1, for 22 h.Ctr, without enzyme.Assay was done in triplicate and standard deviation is shown.(B) Peak at m/z 384.15 corresponding to the composition HexHexNAc is shown.(C) MS 2 spectrum of compound at m/z 384.15 (see Fig. S3 for fragment identification).(D) Extracted ion chromatograms (XICs) of m/z 384.1500, which corresponds to the [M + H] + ion of the C 14 H 25 NO 11 compound.A 5 ppm mass tolerance was used.

Figure 4 .Figure 5 .
Figure 4. Comparison of the AlphaFold model of GLT8D1 with other related GT8 enzymes.The GT-A fold catalytic domain from the GLT8D1 AlphaFold model (AF-Q68CQ7-F1, A and B) was aligned with the related GT8 enzymes, WbbM (panels C and D, PDB 6U4B 23 ), GlyE (panels E and F, PDB 5GVV 24 ), and the xyloxyltransferase domain of human LARGE-1 (panels G and H, PDB 7UI7 25 ).Panels A, C, E and G are displayed as cartoon representations with gray surface and stick representations for the DxD, G-loop and C-terminal His motifs shown in yellow and bound divalent cation as gray sphere.Panels B, D, F and H are zoomed-in representations of the donor binding site for the respective enzymes.For the zoomed-in representations, a bound UDP-Gal and Mn 2+ were modeled in the active site (gray stick representation) based on the UDP and Mn 2+ complex of GlyE and the structure of UDP-Gal from B4GALT1 26 (PDB 2FYC).Putative interacting residues were identified in the respective complexes as residues facing the sugar residue of the donor.Yellow sticks are displayed for the DxD motif (yellow labels) and C-terminal His (green labels).The G-loop motif is displayed in magenta and the xED motif residues are displayed in cyan sticks in each panel.Other interacting residues are indicated by thin stick representation with residue labeling.

Figure 6 .
Figure 6.Mutations related to ALS or soft tissue tumors.Residues in the GLT8D1 catalytic domain associated ALS (G78W, A82E, I87N and R92C, I290M) or soft tissue tumors (Q319E) are displayed as magenta spheres within the cartoon representation of GLT8D1.The position of bound UDP-Gal and Mn 2+ modeled in the GLT8D1 active site are shown in gray stick and slate sphere representation, respectively as shown in Fig. 4B.